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environment. Information regarding these replicated modifications (e.g., insert, delete, or update) are contained in a set of replication 
tables. Thus, modifications can be duplicated at other sites immediately after the original modification, or deferred until the remote site is 
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METHOD AND APPARATUS FOR DATA REPLICATION 

BACKGROUND OF THE INVENTION 

1. FIELD OF THE INVENTION 

5 • , ; 

This invention relates to the field of data replication. 

2. BACKGROUND ART 

10 Because of the ability of computerized systems at one location, to 

communicate, with other locations, computer applications are increasingly 
accessing data located on multiple, local and remote systems. In a read-only 
mode (i.e., no data modifications), multiple copies of the same data item can be 
located at multiple sites without raising any data integrity issues. However, as 

15 multiple users resident at multiple system locations begin to modify one or 
more of the copies of the same data items, data integrity becomes a critical 
issue. 

Ideally, any user should be able to modify any copy of a data item with 
20 an ability to automatically propagate the modification to each copy of the same 
data item at any location. Prior art systems fall short of providing this type of 
peer-to-peer environment. 

For example, some systems provide one "master" copy with multiple 
25 "slave" copies. Any modifications are performed on the "master," with the 
"slave" locations receiving a copy of the modified data after the modification is 
completed on the "master." Thus, a user at a "slave" location must access the 
"master" copy to modify a data item. This technique does not provide the 



, WO 95/08809 PCT/US94/1009J, 

-2- ." 

ability to update any copy, and propagate the changes performed on that copy 
to all other copies. , • 

For example, Cordi et al„ U.S. Patent No. 4,077,059, discloses a multi- 
• • • 
5 processing system with a hierarchical memory having journaling ftfid 

copyback. The hierarchical memory system has two memory units on each 

level. One main unit contains all of the data for the level, and the other unit, 

the copyback unit, contains the changes that have been made to that d&ta' either 

by addition or modification. The main unit interfaces with the next higher 

10 level in the hierarchy and its processing unit. The copyback unit transfers the 
data changes to the lower level when the lower level's main unit is not 
interfacing with its next higher level or processing unit. The copyback unit is 
smaller than the main unit to reduce the necessary storage units on each level. 
Since the copyback unit is smaller than the main unit, there is a possibility that 

15 the number of changes to the main unit's data may exceed the capacity of the 
copyback's store. A monitoring routine is used to assure that the number of 
changes did not exceed storage capacity of the copyback unit. , Appropriate 
measures are taken to reduce the number of changes in the copyback store 
when the copyback store reaches capacity. 

20 

Schmidt et al., U.S. Patent No. 4,558,413, discloses a management system 
for managing versions of software updated and stored in designated storage 
devices in a distributed software environment of a local area network. The 
system collects and recompiles versions of a software object (i.e., source and 
25 object modules) located on these storage devices in the LAN. The compiled 
program is used in the distributed software environment. The system 
includes the associated software object's: version, unique name, update 
chronology, dependencies on (and interconnections with) other software 
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objects, and residence. The management system is automatically notified 
when changes are being made to a software object. 

Haas et al., U.S. Patent No. 4,631,673, discloses a method for refreshing 
5 multicolumn tables in a relational database. Haas provides a method for 
refreshing the snapshot (i.e., a read-only copy of a base table portion). Each 
record of a base table must have: (1) a tuple (i.e., unique) identifier , TID, ,(2) the 

previous records TID, PREVTID, and (3) "time stamp," UID, of the records last 

* * 

alteration. The snapshot contains the TID of the corresponding base table 
10 record. At refresh, the snapshot site sends the highest UID seen by the 

snapshot. The base table site identifies alterations based on TID, PREVTID, and 
UID values. Haas describes a master-slave environment whereby changes to a 
master are propagated to the replicas. 

« 

15 Kendall. U.S. Patent No. 4,635,189, describes a real-time distributed 

database management system that stores in a processor's memory copies of the 
variables needed to run the programs in that processor. When a variable is 
created, a processor is designated as the processor that determines the value of 
that variable. Each variable copy is updated by the current value of the original 

20 value on a periodic basis or upon the occurrence of a defined condition. 

Kendall describes a method of data manipulation such that a first processor can 
address an original variable in a second processor, and direct the most current 
value of that variable be stored in a third processor upon the occurrence of a 
condition in a fourth processor. An acknowledgment message can then be 

25 sent to a fifth processor. 



Boyle. U.S. Patent No. 4,646,229, describes a database system that includes 
future versions of the database for use in time-oriented applications such as an 
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application for scheduling the use of the same facilities to present and future 

users. A1J of the information required to represent the date base contents at 

• i " » . 

desired future points in time is maintained in the data base. All transactions 
(e.g., logical units of work) are time stamped to assure access to the proper 
5 version of the database. 1 

Gallant, U.S. Patent No. 4,648,036, refers to a method for controlling 
query and update processing in a database system. Specifically, Gallant 
describes a method for ensuring that a query recedes information 

10 representative of the database either before or after an update, but not 
information representative of a state after the update begins but before it 
completes. Transactional modifications are made to a future database 
structure. At the completion of the modifications, a switch is made from the 
present database structure to the future database structure. A query process 

15 accesses the present database structure. 

Gladney et al., U.S. Patent No. 4,714,992, refers to a method for managing 
obsolescence of replicas of data objects in a distributed processing system. 
Database objects at a source location are replicated at a replica location. As 

20 objects stored at the source location are altered, corresponding objects at the 
replica location become obsolete. A replica location generates a request for a 
list of obsolete objects from the source location. Gladney describes a means for 
identifying the obsolete objects, communicating the identify of the obsolete 
objects, and removing the obsolete objects from the replica location. Gladney 

25 describes a master-slave environment whereby changes to a master are 
propagated to the replicas. 
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Ecklund et al., U.S. Patent No. 4,853,843, describes a multi-version 
database where each update operation creates a new version of the database/ 
and the older versions remain available. Multiple alternative version paths 
are retained. Ecklund describes a method for deriving a minimal <cot of 
5 alternative version paths. When updates are applied to a partition they are 
performed synchronously on multiple sites in the partitions. Change list 
derived from the database histories and virtual partition change histories are 
used to determine the existence of conflicts. Ecklund describes a system for 
merging multiple versions of a data object into a distributed database such that 
10 each updating partition can access its own version. 

Carey etal, U.S. Patent No. 4,875,159, describes a system for 
synchronizing two versions of files in a multiprocessor system. Both versions 
contain a sync-complete control field and a sync-in-progress field. The sync- 
15 complete field indicates that the associated version is synchronized when it is 

set. The sync-in-progress field indicates that the associated version is in the 

» 

process of being synchronized when it is set. If the sync-complete field is 
cleared in one or both of the versions, the sync-in-progress field is set in one of 
the versions. Then, a temporary file is created, and a copy of the one version is 
20 transferred to the temporary filed. The sync-in-progress field of the one 

version is examined after the transfer is complete. If the sync-in-progress field 
is set, the sync-complete field is set in the temporary version. The temporary 
version is renamed to the other of the versions and the original of this version 
is removed. 



25 



Bovkin et al.. EPO 0,428, 264 A2, discloses a method for generating an 
access plan, in a database system, containing low-level code for performing 
preselected constraint checks. Database access commands are compiled into 
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access plans that are executed at runtime instead of the access commands to 
improve system performance. 

Roussopoulos, N. and Kang, H., "Principles and Techniques in the 
5 Design of ADMS+," IEEE. December 1986, pp. 19-25 describes a technique for 
downloading database objects from a mainframe to a workstation as the 
workstation accesses data on the mainframe. Access to the localized database 
subset are performed at the workstation. Database objects accessed by multiple 
workstations are globalized. Updates at the mainframe are logged and 
10 incrementally applied before a query of the involved data is performed. 
Modifications to the downloaded objects are performed by maintaining a 
backlog consisting of entries each of which consists of an operation (i.e., insert 
or delete), tuple-id, and Dl-log (i.e., either a pointer to a file containing deletes, 
or a pointer to a newly inserted tuple). 
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SUMMARY OF THE INVENTION 

The present invention provides the ability to replicate modifications 

(e.g., insert, delete, or update) made at a local site to multiple remote sites in a 

5 peer-to-peer environment. Information regarding these modifications are 
i * 
contained in a set of replication tables. Thus, modifications can be duplitated, 

asynchronously, at other sites immediately after the original transaction's 

modifying commitment, or deferred until the remote site is available. 

♦ 

10 The replication tables of the present invention include a transactions 

table, transaction nodes table, calls table, call nodes table, and an exceptions 
table. The transactions table contains information about a transaction. The 
transaction nodes table contains an entry for each remote site at which a 
transaction is to be applied. The calls table .contains an entry for each 

15 procedure (i.e., transactional or non-transactional series of logical steps) to be 
replicated at remote sites. The call nodes table identifies nodes at which a 
procedure is executed. The exceptions table contains information regarding an 
exception raised when processing a replicated procedure at a data site {e.g., 
conflict or informational message). 

20 

The present invention provides a value-oriented row level and column 
level replication. Row-level replication modifies a remote site based on the 
old and new values contained in a row at an originating site. Column-level 
replication modifies a remote site based on the column values at an 
25 originating site. Further, row-level and column-level replication provides the 
ability to identify conflicting modifications at the remote site. A trigger (i.e., a 
procedure) is executed each time a modification is made to a data item. A 
trigger queues entries in the replication tables. The table entries retain 
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information such that the original modificatioji(s) associated with a data item 
can be propagated to remote copies of the same data items. 1 The new values 
can be compared to the values contained in the remote copies to identify any 
modifications that could be lost, if the current modification is applied. 

5 / • : 

The present invention further provides a logic-oriented procedure-level 
replication. Pjocedure-level replication modifies a remote site based on the 
logical operations used to modify data at the originating site. Procedure-level 
replication provides the ability to identify conflicting updates as well. Thus, 
10 the same logical operations performed on one copy of a data item can be 
propagated to, all other copies of the same data item. 

Information concerning conflicts identified by the present invention can 
be retained in the replication tables. The information contained in the 

15 replication tables can be used immediately, or subsequently, to address any 
conflicts detected by the present invention. Procedure-level replication 
provides the ability to address conflicts, and other exceptions, within the 
procedure replicated to, and executed at, the remote data locations. The 
present invention further provides the ability to rollback any modifications 

20 made when a conflict is identified. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

• • • . • 

Figure 1 illustrates one or more computerized systems at one or more 
locations each containing copies of data items. 
5 1 

Figure 2A illustrates the structure of the data items stored at orte , 
location, 

» 

Figures 2B and 2C illustrate an order transaction and sample data 

♦ 

10 updated by the order transaction. 

Figure 2D illustrates a replication of the transactions processed at 
multiple data locations. 

15 Figure 3 illustrates replication tables. 

Figure 4 illustrates a process flow of a trigger. 

Figure 5A-5C illustrates a process flow for modification operations on 
20 local and remote copies of data items. 

Figures 6 and 7 illustrate a technique for entering replication 
information in the replication tables. 

25 Figures 8A-8B illustrate replication tables including row-level 

replication information. 

Figures 9A-9b illustrate procedures using procedure-level replication. 
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Figure 10 illustrates replication tables including procfedure-level 
replication information. 

5 Figures llA-HB illustrate the execution of transactional and non- 

transactional deferred remote procedure calls. 

Figure 12 illustrates a Exceptions process flow. 



10 



Figure 13 illustrates a ReconstructParms process flow. 
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DETAILED DESCRIPTION OF THE INVENTION 

■■■»',. 

A method and apparatus for data replication is described. In the 
following description, numerous specific details are set forth in order to 
5 provide a more thorough description of the present invention. It will be 

apparent, however, to one skilled in the art, that the present invention may be 
practiced without these specific details. In other instances, well-known 
features have not been described in detail so as not to obscure the invention. 

10 In a networked environment consisting of one or more locations (e.g., 

database servers or computer sites), duplicate copies of the same data may be 
resident at more than one location (e.g., one or more database servers or 
computer systems). Figure 1 prpvides an example of a networked 
environment containing copies data resident at multiple sites. Data site A 100 

15 may be any type of computerized system (eg., networked database server, 
mainframe system, or personal computer system). Similarly, data site B 110 
can be any type of computerized system. Data site A 100 contains database A 
120. Data site A 100 and data site B 110 are interconnected via communication 
link 125. 

20 

Initially, databaseB 130, located at data site B 110, contains a duplicate 
copy of databaseA 120. Thus, a copy of the same data item is available at both 
sites. That is, a user that accesses databaseA can read a data item (e.g., number 
of brooms in inventory) at data site A 100 while another user may access the 
25 number of brooms on hand by reading information resident at data site B 110. 
As long as user A and B access the quantity on hand data item in read-only 
mode, the value of this data item at both locations will remain constant and, 
therefore, consistent. 
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Figure 2A further illustrates databaseA 120. Database A 120 contains two 
relations, or tables. The inventory table 202A contains two fieids: Item 204A 
and QuantityOnHand (qoh) 204B. The item and qoh fields comprise the 
5 information for each inventory item contained in inventory table 202 A. The 
orders table 202B contains order information associated with a customer's 
order (e.g., originating location, customer placing the order, item ordered^ 
quantity of item ordered, and whether the order can be filled by stock qn hand). 
Each table contains entries, or rows. For example, inventory table 202A 
10 contains three entries 206A-206C. ' 4 

Referring to Figure 2B, DatabaseB 130, like DatabaseA, contains an 

♦ 

inventory and orders tables. Further, DatabaseA and DatabaseB contain 
identical entries and values for each entry. Figure 2B further provides a set of 
15 steps that can be used to modify the data contained in either DatabaseA or 
DatabaseB. 

This set of steps, a typical order transaction (i.e., order placement) 
consists of steps for checking the qoh to determine the amount of the ordered 
20 item in stock, updating the qoh in the inventory table (i.e., where qoh > 

quantity ordered), and inserting an entry in the orders table to reflect the order. 
This process is reflected in steps one and two of the basic order transaction 
provided in Figure 2B. 



25 



The final step in the basic order transaction, the commit step is an 
implicit step in transaction processing. It provides the ability to make any 
changes that have been made to the tables permanent. Prior to executing a 
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commit (i.e., makes the table changes permanent), the changes made to the 
inventory and orders tables can be rolled back. • 

» ■ * 

Figure 2B illustrates the initial state of the tables in the (wo databases. 

♦ 

5 However, users A and B can update the tables using the basic order transaction. 
That is, when user A receives an order from a customer <e.g., customer 10 

orders fifty widgets) and invokes the order transaction at location A {i.e.„ 

♦ 

databaseA), the order transaction will update the inventory and orders tables 
in DatabaseA. DatabaseB will remain unchanged. Thus, after user A's order 
10 processing transaction, DatabaseB will no longer be identical to DatabaseA. 

Figure 2C illustrates the resulting databases after both user A and B use 

the order transaction to process an order received at their respective sites. For 

* 

example, user A receives an order for fifty \ridgets from customer 10. User A 
15 invokes the order transaction. The order transaction updates the inventory 
table (i.e., the qoh field associated with the widget inventory item) and the 
orders table (i.e., adds an entry to reflect the order) in database A. Similarly, 
user B invokes the order transaction to process the order received from 
customer 11 at site B (i.e., for forty widgets). The transaction invoked by B 
20 updates the inventory table and orders table in database B. 

Both orders consisted of an order for widgets. Customer 10's order was 
for fifty widgets, and Customer ll*s order was for forty widgets. Therefore, a 
total of ninety widgets were sold. However, database A does not reflect 
25 customer ll's order, and database B does not reflect customer 10's order. Thus, 
each database only reflects the orders processed by one of the users (i.e., A or B), 
and will therefore not reflect all of the orders that have been processed at all of 
the sites. 
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Thus, there is a need to propagate local modification to all remote copies 
of the same data item. The present invention provides this ability to replicate 
the data modifications made at one location to other locations. Thus, the 
5 databases illustrated in Figure 2C are be replicated to other sites such that order 
transaction modifications at one site are be applied to other sites, and the data 
items are again consistent. 

Figure 2D illustrates the state of the two databases before and after the , 
10 replication capabilities of the present invention are used. Before replication, 
the two databases reflect only the order transactions processed locally (i.e., 
database A reflects user A's order transaction^ and database B reflects user B's 
order transaction), and not the order transactions processed on a reimote 
database (e.g., .database B is a remote database to user A). 

15 

After the DbB=>DbA and DbA=>DbB replications, user A's data 
modifications are reflected at user A's local database (i.e., database A) and at 
database B (i.e., remote site). Similarly, user B's data modifications are 
reflected at both the local and remote sites. The widget's qoh value in both of 
20 the databases reflects the overall decrease in the qoh associated with both of the 
orders. Further, the orders tables in both databases reflect both of the orders 
received. 

The present invention provides the ability to replicate data at the row 
25 level and at the procedure level. Row level replication (i.e., value-oriented 

replication) is accomplished by associating a trigger with a table (e.g., inventory 
table). A trigger is a procedure that is executed when a modification (e.g., 
update, insert or delete) occurs to a row in a table. A trigger identifies a 
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deferred remote procedure call (DRPC) that has as its arguments the old 
values, new values, and the operation (e.g.-, insert, delete, or update). 

Procedure-level replication propagates the operation gather than the 
5 row, values (i.e., logic-oriented). After a procedure is executed at the 

originating site, the procedure defers a call to itself at another site. The DRPC 
will apply the logical update of the original procedure at a remote site. 
Procedure-level replication requires less network traffic than row-level 
replication since one DRPC can be used for multiple tables. 

i 

10 

Modification. Identification and Retention 

To propagate the modifications made to data items in one database to 
the same data items in another database, it is necessary to retain the 

15 modifications until they can be made to the other sites. PrioT art methods use 
a transactional recovery log (i.e., redo log) to retain and identify the database 
modifications for propagation. However, the redo log was originally intended 
for the application of transactions in a single database system, and contains 
"undo" information (i.e., information that can be used to roll back changes 

20 made to data after an event, such as a system failure, occurs). 

For example, a redo log can be used on a single copy of the data to undo 
changes made to a database by one or more transactions when a system or 
application error is encountered. When such an error is encountered, the 
25 transactional updates made prior to the error (and related in some way to the 
error), must be undone to maintain the data integrity in existence prior to the 
updates. However, the redo log was not designed for capturing modification 
information for propagation to a second database system. Thus, when the log 
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is used for its intended purpose as well as a resource for data replication, a ' 
storage management problem arises, because a redo log that it used to retain 
propagation information can never be moved off-line (i.e., made inaccessible • 
to the database system). 

5 

Unlike the prior art systems, the present invention provides a 
propagation identification capability that can.be managed in the same way as 
any other table, or relation, managed by a database administrator The present 
invention provides the ability to encode the propagation information in tables 
10 in the database system. The information stored in these tables can be retrieved 
as any other data within the database system, and can be accessed at any time. 

The tables include the information necessary to replicate a data 
modification to other data sites. For example, the tables contain information 

15 relative to a DRPC , its replication destination, the transaction the DkPC is a 
part of, the order in which a DRPC is executed within a deferring transaction, 
the order in which transactions are executed relative to other transactions, and 
the arguments used by each DRPC. The present invention uses the following 
tables: Transactions, Transaction Nodes, Calls, Call-nodes, and Exceptions. 

20 Figure 3 illustrates a composition of these tables. Because of the versatility of 
the present invention (e.g., replication information is stored in relations), 
additional information can be added to these relations. 



25 



— Transactions Table — 

The Transaction Table contains information about transactions that are 
performed on the data and that use deferred remote procedure calls (i.e., 
DRPCs) in some manner. The transactions table consists of the following 
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fields: transaction identifier, delivery order number (DON), start time, 
deferring user identifier, and destination-list. The Transaction identifier 
("Transactionjd") is a unique identifier that is assigned to each transaction. 
The Transaction_Id further uniquely identifies the origin database for deferred 
5 transactions. Transaction _Id is the primary key for the Transaction table (i.e., 
the value that uniquely identifies an entry in this relation). The destination- 
list controls whether destinations for a transaction are described by the call- 

* 

nodes table or external routing tables. 

1 • 

♦ 

10 The DON is an abstraction of a system change number. A DON reflects 

the commit sequence of a transaction relative to all of the other transactions 
listed in the transaction table. The DON respects the partial order of the 
commit of the transactions in the deferring database. Thus, if transaction one 
Tl touches the same data as transaction two T2 and T2 commits before Tl, the 

15 DON of transaction two (D2) is less than Tl's DON (Dl). 

The time field reflects the time that the transaction was started, and the 
deferring user field identifies the user who initiated the deferred transaction's 
procedure call. This information can be used to monitor and control access 
20 (i.e., for security reasons). For example, prior to making any modifications to 
data at a remote site, a check can be made to determine whether the deferring 
user has the access privileges that would enable the modification at the remote 
site. 

25 The replication capability provided by the present provides the ability to 

modify all or some portion of the data modified by a transaction by site 
specification. That is, a deferred transaction can consist of a subset of the calls 
deferred by the original transaction for any given destination site. For 
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example, an order transaction at site A can update an orders and inventory 
tables, and the replicated transaction at site B can update the orders table only. 

i 

A call nodes table can be used to define a transaction's calls that are 
5 applied at a given site. In addition, any routing mechanism can be used. For 
example, a mapping of calls to destinations can be defined according to the 
• name of the procedures beings deferred. The destination list field of the 
transactions tables indicates the mapping mechanism (e.g., call nodes table or 
other routing mechanism) used. 

10 

— Transaction Nodes Table — 

The Transaction Nodes Tablie (i.e., destinations table) identifies the nodes 
or remote si-test' at which the transactions contained in the transactions table are 
15 to be executed. The transaction node table contains one entry for each node 
(i.e., remote site) at which a transaction is to be executed. 

The transaction identifier ("Transaction_Id") has the same definition as 

the same field in the transactions table. The Destination Node ("dest^node") 
20 identifies the nodes (i.e., remote databases) at which the transaction is to be 

executed to replicate the changes made by the transaction on the local data site. 

Thus, the same transaction_id can be used to access an entry in the transaction 

table, and any corresponding entry or entries in the transaction node table. 

Further, an entry in the transaction nodes table identifies one of the remote 
25 sites to which the transaction's modification are to be propagated. A 

transaction_id and dest_node combination can uniquely identifies an entry in 

the transaction nodes table. 
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~ Calls Table - 

As illustrated in the order processing example provided earlier, 
transactions (i.e., a logical unit of work) can be composed of ?teps, or 

5 procedures (e.g., place order and inventory check). In software encoding these 
steps, each can be considered a separate procedure to provide additional 
structure to an application. Further, procedures can be defined without being a 
part of a transaction. Information regarding either type of procedure is 
retained in the calls table. The calls table contains a unique identifier, Call 

10 Identifier ("calMd"), that can order a call within a transaction, or orders non- 
transactional DKPCs relative to all others. 

Like the transactions and, transaction nodes tables, the calls table 
contains a transaction_id. For transactional DRPCs, the transaction-id has the 
15 same definition as the transaction_id field in the transactions and transaction 
nodes tables. For non- transactional DRPCs, the transactionjd field is not 
used. 

The Deferred Procedure Identifier Cproc_id") identifies the procedure, 
20 or call, (i.e., series of program steps) that is to be executed at the remote site. 
For example, the proc_id may be a character string containing the procedure's 
name. It could also be a system-provided, unique identifier that includes the 
location (e.g., storage address) of the procedure. The parameter count identifies 
the number of parameters (values passed into the procedure for use during the 
25 execution of the procedure) for a procedure. 
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The parameters field (parms) is a long raw byte string containing the 
parameters for the entry in the calls table. The format of the field is as follows: 

5 <tci><leni><valuei><tc2><len2><value2> - <tCn><>enA>£value n ><0> 

Where: 

<tci> is the parameter type code for the ith parameter (i.e., whether the 
parameter is of type number, character, date, rowid, or null); , ' 

<leni> is the two byte binary integer value of th$ length of valuei (length 
0 of zero indicates a null parameter value); ♦ 
<valuei> is the parameter value; 

<0> is a single byte value indicating the end of the string; 

— Call-Nodes Table - 

15 • 

The call-nodes table contains a row for every destination of each 
deferred call when the destination is not defined by an external routing table. 
The call-nodes facilitates the ability to replicate a transaction comprised of 
multiple procedure calls by specifying the execution of some or all of the calls 

20 at a given site. When a call's destinations are not defined by an external 

routing structure, call nodes are specified by the deferring user, either with the 
deferred call, as transaction default destinations, or as system determined 
destinations. When a deferred transaction is being sent to a destination, the 
call-nodes table is queried to select those calls that are to be executed as part of 

25 the deferred transaction at the destination. 
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The Transaction Identifier and the call identifier are the same as those 
in the calls table. The destination node field identifies the node at which the 
execution of a procedure is deferred. 

5 — Parameters Table — 

In an alternate embodiment, a table can be used to retain the parameters 
for a call instead of storing the parameters in the calls table. In this 
embodiment, the parameters table contains an entry for each parameter used 
10 by an entry in the calls table. That is, for each call containing parameters, there 
is one or more entries in the parameters table. Each entry in the parameters 
table contains a parameter for an entry in the calls table. 

The parameters table contains a Call Identifier ("calMd"). Like the calls 
15 table, the call-id identifies a procedure call. A procedure call with more than 
one parameter contains an ordered list of parameters. The Parameter Number 
("param^no") can, therefore, identify an entry's location within a procedure 
call's ordered list of parameters. A call_id and param_no pair can uniquely 
identify an entry in the parameters table. The type field contains a code that 
20 indicates the type of parameter. That is, the type field indicates whether the 
parameter table entry is a number, character, date, rowid. 

Only one of the remaining fields (i.e., Number, Character, Data, Rowid) 
is used for each entry in the table. That is, if the parameter is a number, the 
25 value of the parameter is stored in the number field. Similarly, if the . 
parameter is of type character, the value of the parameter is stored in the 
character field. A date parameter value is stored in the date field. Rowid 
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information (i.e., identifier specifying a row within a table) is stored in the 
rowid field. 

— Exceptions Table — 

'5 • ' • '* , . 

The exceptions table is used to store information related to* any 
exceptional or occurrence during executions of a deferred transaction. This 
information can be subsequently reviewed, and the exceptional occurrenfce can 
be addressed. For example, multiple conflicting updates may occur to different 
10 copies of replicated data. Thus, one transaction, Tl, can update one copy of 
record A, CI, and a second transaction, T2, can update a second copy of record 
A, C2. If Tl is propagated to C2, Tl can overwrite T2's update, and vice versa. 
The present invention detects this type of exception, and others, and retains 
information for each exception. , 

15 

The exceptions table contains a Transaction Identifier ("transaction_id M ) 
field that has the same definition as the transaction_id field, in the 
transactions, transaction nodes, and calls tables. The call identifier has the 
same definition as the call_id field in the calls table. The Destination Node 

20 ("dest_node") identifies the node at which the exception occurred. In the 

previous paragraph's example, the node field would contain the identification 
of the node that stores C2. The error code field (error_code) contains a code to 
identify the error, or exception, encountered (e.g., overwrite potential for T2's 
update of C2). Further, the error string field contains an additional description 

25 of the error. A trans action_id, and dest_node combination can uniquely 
identify an entry in this table. 
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Populatin£ Replication Tables 

The modification replication provided by the present invention is 

asynchronous. That is, the replication procedures that modify 'the remote data 

5 copies (e.g., <table_name>_insert) do not execute as part of the modification 

operation that is performed at the local, originating site. Rather, a 

modification to the remote data copies can be deferred until the remote copies 

♦ 

are available. 

1 • 

♦ 

10 The process of deferring the modification operation at the remote site is 

accomplished, in the present invention, by storing the information for each 
deferred modification operation in the replication tables, and subsequently 
performing the modification operation identified in the replication tables at 
the remote sites. . 

15 

Referring to DatabaseA (DbA) in Figure 2C, for example, the 
modifications to the Inventory and Orders tables can be replicated to DatabaseB 
(DbB) in Figure 2C by replicating DbA's modification to DbB's Inventory and 
Orders tables. Thus, the update performed on DbA's inventory table and the 
20 entry inserted in DbA's orders table can be replicated at the DbB site by 

replicating the Basic Order Transaction's modifications performed on the data 
in DbA. 



The basic order transaction is replicated at DbB by queuing the 
25 transaction in the replication tables, and subsequently applying the 

modifications contained in the transaction on the data in DbB according to the 
information contained in the replication tables. Figure 6 illustrates a 
processing flow for queuing the transaction, transaction nodes, and <all 



WO 95/08809 PCT/US94/10093 

-24- 

destination tables for a transaction, a transaction's destination nodes, and any 
transactional or non-transactional call. Figure 6 can be invoked multiple times 
to queue any number of calls. For example, the invoking routine can contain a 
looping mechanism that can invoke queue_transactional_DRPC for each 
5 transactional or non-transactional call. 

At decision block 601 (i.e., "first call?", if this is not the first call to queue 
this transaction, processing continues at block 616. If it is the first call, 
processing continues at block 602. At processing block 602, a current 
10 transaction identifier is assigned to the transaction to be stored in the tables 
(e.g., DbA's basic order transaction). The current transaction identifier is 
assigned a value that will uniquely identify the table entry. At processing block 
604, an entry is inserted in the transactions table. The transaction identifier 
field is assigned the value of the current transaction identifier. 

15 

The originating transaction (e.g., DbA's order transaction) is assigned a 
DON when the commit step in the order transaction is successfully performed. 
The DON provides an ability to order transactions based on the order in which 
they have modified the data. Thus, where the order of the modifications is 
20 crucial, the DON can be used to retain the modification order and thereby 
maintain data integrity. The DON field is assigned the value of the original 
transaction's DON. The time field of the new transactions table entry is set to 
the current time. The deferring user identifier is assigned the value of the user 
that originated the original transaction. 

25 

A destination (i.e., remote data copy identifier) is identified at processing 
block 606. At processing block 608, an entry in the transaction nodes table is 
created to indicate that the transaction currently being entered into the 
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replication table is to be performed at the identified distillation. Thus, the 
transaction identifier is assigned the sanfie value as the same field in the 
transactions table, and the destination node is set to the identified destination. 

5 . An entry should be made for each destination identified. Thus, at 

decision block 610 (i.e., "other remote destinations?"), if additional 
destinations exist, processing continues at processing block 606 to identify the 
next destination. Further, a transactions node table entry is created for each 
such destination. When it is determined, at decision block 610 (i.e., "other 
10 remote destinations?"), that all of the remote destinations have been entered 
into the transactions node table, processing continues at decision block 616 (i.e. 
"all calls in transaction processed?") 

»■ * 
As illustrated in the basic order transaction provided in Figure 2B, a 

15 transaction can be comprised of multiple steps. In the basic Order transaction, 
the steps where: check inventory and process order. These steps can be 
designed and encoded in software as separate procedures. In this case, the basic 
order transaction can contain an inventory_check procedure that exercises the 
steps contained in an inventory check. Similarly, the basic order transaction 

20 can contain the place__order and commit procedures. Each of these calls that 
comprise the transaction can then be entered into the calls table of the present 
invention. 



At processing block 616, a unique call identifier is generated and 
25 assigned to current_call_id. At processing block 618, Queue_call_ar£S is 
invoked to enter the call information into the replication tables. After the 
appropriate call information has been added to the replication tables, 
processing returns at block 620. 
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Figure 7 illustrates a processing flow to queue call information. At 
processing block 700, an entry is created in the calls table for the procedure call 
currently being processed- The entry's call identifier field is set to the 
5 current_calMd. The same value assigned to the transaction idehtifier fields in 
the transactions and transaction nodes tables is used for the transaction 
identifier field in the calls table. The procedure identifier field is assigned a 
value that can uniquely identify the procedure being deferred. This value can 
be anything that identifies the procedure such as a procedure name or the 
10 storage location of the procedure. , t 

During its execution, a procedure can use values (parameters) that are 
externally-generated (i.e., defined outside the procedure) and passed into the 
procedure, or internally-generated. In the preferred embodiment, the 
15 parameters are queued in the parms field of the calls table. 

At processing block 702, parm_count is initialized to zero. At decision 
block 704 (i.e., "all parameters in call processed?"), if all of a procedure's 
parameters have been added to the parameters table, or the procedure does not 
20 have any associated parameters, processing returns at processing block 714. At 
block 714, the value of parm_count is used to update the parameter count field 
of the call's entry in the calls table. 

At decision block 716 (i.e., "call's destination nodes not defined by 
25 external routing mechanism and destination nodes defined by user with 

DRPC?"), if the call's destination nodes (i.e., execution nodes) are defined by an 
external routing mechanism, processing returns at block 724. If the call's 
execution nodes are defined by the user with the DRPC and not by an external 
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routing iriechanism, processing continues at decision block 718, At decision 
block 718 (i.e., "all execution nodes processed?"), if all of the execution nddes' 
have been entered in the call nodes table, processing returns at block 724. 

5 If all of the execution nodes have not been processed, processing 

continues at block 720 to get the next execution node specified by the use*. At 
processing block 722, an entry is created in the call nodes table for the current 
execution node. Processing continues at decision block 718 to process any 
remaining execution nodes. ■ * 

10 . ' ' 

If, at decision block 704 (i.e., "all parameters in call processed?")/ 

# 

parameters remain to be processed, processing continues at processing block 
710. At processing block 710, the parameter's type (i.e., data type), length, and 
value are appended to any existing value in the parms ftekl of the current calls 
15 table entry. At block 712, parm_count is incremented by one. Processing 
continues at decision block 704 to process any remaining parameters. 

In an alternate embodiment, a call's parameters ran be stored in a 
separate table, a parameters table (see Figure 3). In the alternate embodiment, 

20 block 710 creates a separate entry in a parameters table for each parameter. 

Thus, a parameters table contains an entry for each parameter that is associated 
with a procedure. Each entry contains the call identifier, a parameter number 
(i.e., the number of the parameter in relation to the other parameters in the 
call), and the data type of the parameter. The value of the parameter is stored 

25 in one of the value fields (i.e., number, character, date, or rowid) based on the 
data type. For example, if the inventory_check procedure in the basic order 
transaction contained three parameters, three entries would be added to the 
parameters table. 
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i 

Triggers , • 

• I 

i 

Triggers provide one alternative to initiate the population of the 
5 replication tables '. A trigger is a procedure that is executed when any 

modification (e.g., update, insert or delete) is performed on an entry in a table 
at the local site. Figure 4 provides an example of the process flow of a trigger in 
the present invention. • 

10 Decision block 402 (i.e., "is this trigger firing as a result of a replicated 

modification?".) illustrates an issue addressed by a trigger in the present 
invention. Because a trigger is initiated when any modification operation is 
performed on a table, an operation that is performed on a remote data entry 
(i.e., a modification operation) will result in the initiation of a second trigger. 

Unless a trigger contains some mechanism for differentiating between a 
modification done as a result of an original modification operation and a 
modification that is a replicated modification (i.e., the result of an earlier-fired 
trigger), the replicated modification will itself generate a trigger, and the 
20 original modification operation could be replicated multiple times at a data site 
unnecessarily. This would endanger the integrity of the data at the local and 
remote sites. 

Thus, decision block 402 (i.e., "is this trigger a result of an earlier-fired 
25 trigger?") determines whether the table modification that generated the trigger 
is a consequence of an original or replication modification. If it is an original 
modification, the trigger is not the result of an earlier-fired trigger. However if 
the modification is the result of an original modification that has been 
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replicated to a remote table, a second trigger should not be .queued in the 
replication tables. 1 ' 

Therefore, if, at decision block 402 (i T e., "is this trigger 4 result of an 

1 

5 earlier-fired trigger?"), a trigger is the result of replicated modification, a 

processing ends at block 406, and the modification procedure is not queued. If, 
at decision block 402, the trigger was generated as a result of an original 
modification, processing continues at processing block 404. At processing blocjc 
404, a modification entry is inserted in the replication tables. Processing then 
10 ends at block 406. 

The mechanism for identifying duplicative replications can be 
implemented iij various ways in the present invention. In the preferred 
embodiment, a duplicative replication can also be detected by netting a global 

15 variable (i.e., a variable that can be accessed by any trigger or replication 

procedure) before a replicated modification is performed. When the replicated 
modification is performed, the trigger can (at decision block 402 in Figure 4) 
check the global variable to determine whether the modification is the result 
of a replication procedure or an original modification. This alternative of 

20 setting the global variable in the replication procedure is further illustrated in 
connection with row-level replication discussed below. 

In an alternate embodiment, a duplicative replication can be detected by 
associating data modifications to a user. Thus, it is possible to identify an 
25 original modification by its user, and to identify a replicated modification by 
another user (i.e., modification procedures can be executed by a distinguished 
user). Thus, decision block 402 in Figure 4 can check for the user name that 
invoked the modification. If the user is a distinguished user, the trigger was 
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re- 
generated as a result of an earlier-fired trigger. If the user is not a distinguished 
user, the trigger was generated as a result of an original modification. 

These alternatives, provide examples of techniques for detecting 
5 duplicative modifications. Any means can be used to detect duplicative 
modifications without departing from the scope of the present invention. 

t 

, t 

Row-Level Replicatio n Procedures 

10 Row-level replication is a feature of the present invention that uses 

triggers to replicate row-level value-oriented modifications. That is, row-level 
replication provides an ability to replicate changes made to the valu.es in a row. 
Row-level replication associates a trigger with a table (e.g., DbA's inventory in 
Figure 2A) such that any changes made to one or more values in a local table 

15 entry (e.g., qoh field in DbA's inventory table of Figure 2A) will trigger a like 
change to remote copies of the changed values (e.g., qoh field in DbB's 
inventory table of Figure 2A). 

A trigger causes the information associated with a procedure used to 
20 replicate the value changes in the local copy to a remote copy to be stored in 
the replication tables. The procedure, a deferred remote procedure call (DRPC), 
can be subsequently executed at remote sites to replicate the data 
modification(s) performed on local data. The name of the DRPC corresponds 
to the table being modified and the operation being performed on the local 
25 data (e.g., <table_name>_update). The DRPC has as its arguments (i.e., 

parameters), generally, the old values of the local data and the new values of 
the local data. The old values, or a subset thereof, uniquely identify the row 
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that is the target of the modification. The use of these arguments is specific to 
the operation that is performed at the local site. 

For example, if an update operation is performed on the local data, the 
5 old values would be used to detect conflicts. That is, a difference between the 
old values of the local data and the current values at the remote site may 
indicate that a separate operation has been performed on the remote data that 
may get erased with the current update. The new values can be used to 'update 
the remote data. 



10 



An insert operation (i.e., insert a row of values or a field value) there are 
no old values, and, therefore, no old values are included in the call. Further, 
there is no need to use old values to perform a check for conflicts (i.e., . 
exceptions) at the row-level when inserting a new row or field value. The new 
15 values, or a subset thereof, uniquely identify a new remote table entry. 

If the operation is a delete (i.e., delete a row or field), there are no new 
values. However, like an update operation, the old values can be used to 
detect potential conflicts. Further, the old values, or a subset thereof, can be 
20 used to uniquely identify the remote table entry to be deleted. 

A DRPC name can incorporate the name of the table to be modified and 
the operation to be performed. Referring to Figure 2A, for example, the 
triggers for the inventory table can have the names: inventory_insert, 
25 inventory_update, and inventory_delete). This naming convention assists in 
identifying the table and operation involved in the replication process. 
However, any naming convention can be used with the present invention. 
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— Row-level insert — 

Figures 5A-5C provide an example 6f the process flow for the 
<table_name>_insert, <table_name>_delete, and <table_name>_update 
5 DRPCs. These DRPCs execute at sites remo,te from the original modification. 
Therefore, when they refer to "remote tables," they are referring to tables local 
to the DRPCs execution and remote from the original modification. Figure 
4A provides an example of the processing flow of a <table_name>_insert 
DRPC. As discussed previously, a global variable can be used as one 
10 alternative for identifying replicated modifications. Such a global variable is 
set at processing block 502. At processing block 504, a roW is inserted iri 
<table_name> using the value(s) provided in the new_values parameter(s). 

To further illustrate the need to test for duplicative modifications, the 
15 replicated insert operation performed by the process of Figure 5A would 
generate a trigger (i.e., any table modification initiates a trigger). Thus, the 
trigger process of Figure 4 is invoked when the insert operation of processing 
block 504 (Figure 5 A) is performed. Because the g^bal variable was set to 
indicate a duplicative modification (at processing block 502 of Figure 5A), the 
20 trigger can determine (at processing block 402 of Figure 4) that the modification 
is a replicated modification, and a DRPC will not be queued for the replicated 
modification (i.e., at processing block 404 in Figure 4). 



25 



Continuing with the process flow of Figure 5A, after the insert operation 
is performed on the remote table, the global replication variable is reset at 
processing block 506. Processing ends at block 508. 



WO 95/08809 PCT/US94/10093 ' 

-33- 

— Row-leVel update — 

• ♦ 

Figure 5B provides an example of the processing flow of an update 
DRPC (e.g., <table_name>update). A global replication variable.is set at 
5 processing block 522. At processing block 524, the remote table entry is 

identified using the old values, or a subset of the old values. At decision block 
526 (i.e., "row found at remote site?"), if the remote table entry cannot be, 
found, processing continues at processing block 532 to log an exception in the 
replication tables. Further, the global replication variable is reset at processing 
10 block 548, and processing ends at block 550. If the remote table entry is found, 
processing continues at decision block 530. 

Before an update is made to a remote site, a check can be made to 
determine whether a modification has been .made to the remote data that is 

15 independent of the current update operation that might be erased if the 
current update is performed on the remote data. This might occur, for 
example, when a modification (other than the current, replicated operation) 
could have originated at the remote site, the replication of which has not 
reached the site that invoked the current replicated update. If the current 

20 replicated update overwrites the remote table entry's values with the new- 
value parameters, the remote table entry's current values will be lost, and the 
remote table's original modification will, therefore, be lost. 

Alternatively, in some applications, concurrent modifications to 
25 disjoint sets of non-primary field values can be permitted. For example, a 
modification to a customer's balance need not conflict with a change to a 
customer's address. If updates can be applied to non-primary fields on a field- 
by-field basis, concurrent updates are not lost. At decision block 530 (i.^., "type 
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of lost update prevention?* 1 ) determines whether the lost update is a row level 
update or £ column level update. If it is a row level update; processing 
continues at decision block 536. If it is a column level update processing 
continues at decision block 534. 

5 ' / . • ; 

At decision block 534 (i.e., "is each fiqld value equal to its corresponding 
old_value parameter where the corresponding old value parameter is not 
equal to the corresponding new value parameter?), if the old values are equal 
to their corresponding old_value parameter where the old value parameter is 

10 not equal to the new value parameter, processing continues at processing block 
544 to update the fields that have been changed, and processing continues at 
block 548. If not, processing continues at decision block 538 (i.e., "should lost 
updates be prevented?"), if lost updates should be prevented, processing 
continues at block 540 to invoke Exceptions, and processing continues at block 

15 548. 

At decision block 536 (i.e., "is each field value equal in the row equal to 
its corresponding old_value parameter?), if the old values are equal to their 
corresponding old_value parameter, processing continues at processing block 
20 546 to update each field in the row with its corresponding new_value 

parameter, and processing continues at block 548. If not, processing continues 
at decision block 542 (i.e., "should lost updates be prevented?"), if lost updates 
should be prevented, processing continues at block 540 to invoke Exceptions, 
and processing continues at block 548. 

25 

At processing block the global replication variable is reset. Processing 
ends at block 550. 
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— Row-level delete — 1 » 

Figure 5C provides an example of the processing flow pf a delete DRPC 
5 (e.g., <table_name>delete). A global replication variable is set at processing 
block 562. At processing block 564, the remote table entry is identified using 
the old values, or a subset of the old values. At decision block 566 (i.e., "row 
found at remote site?"), if the remote table entry cannot be found, processing 
continues at processing block 572 to log an exception in the replication tables. 
10 Further, the global replication variable is reset at processing block 576, and 
processing ends at block 578. If the remote table entry is found, processing 
continues at decision block 528. 

As in the update DRPC process, a check is made to determine if a check 
15 should be made for lost updates (i.e., modifications). Thus, decision block 568 
(i.e., "should lost updates be prevented?") determines whether to test for 
potential lost updates (i.e., lost updates). If not, processing continues at 
processing block 574, and the remote table entry is deleted from the remote 
table. After the delete operation is performed on the remote table entry, the 
20 global replication variable is reset at processing block 576, and processing ends 
at block 578. 

If, at decision block 568 (i.e., "should lost updates be prevented?"), 
existing modifications should be preserved, processing continues at decision 
25 block 570. At decision block 530 (i.e., "is each field value in the row equal to its 
corresponding old_value parameter?"), if any of the remote table entry's field 
values do not equal its corresponding old_value parameter, processing 
continues at processing block 572. At processing block 572, an exception is 
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logged in the replication tables. Processing continues at block 576 where the 
global replication variable is reset, and processing ends at block 578. • 

If, at decision block 570 (i.e., "is each field value in the row equal to its 
corresponding old_value parameter?"), all of the field values 'in th? remote 
table entry are equal to their corresponding old_value parameters , processing 
continues at processing block 574. At processing block 574, the remote table 
entry is deleted from the remote table. After the delete operation is performed 
on the remote table, the global replication variable is reset at processing block 
576, and processing ends at block 578. ' , 

Row-level Replication Example 

» 

The replication illustrated in Figure 2C (i.e., DbB=>DbA and DbA=>DbB) 
can be accomplished using triggers and row-level replication. The order for 
fifty widgets at the database A location resulted in the invocation of a basic 
order transaction to update the inventory table and place an order in the orders 
table. Either of these modifications will cause a trigger associated with either 
table to execute. 



For example, when the qoh field of DbA's inventory table is updated by 
subtracting the quantity ordered, a trigger associated with the inventory table 
(and illustrated in Figure 4) invokes the procedures of Figures 6 and 7 to 
populate the replication tables with a DRPC. In this case, an Inventory _update 
DRPC similar to the <table_name>_update DRPC illustrated in Figure 5B can 
be used to replicate the changes in DbA's inventory table to DbB. 
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Referring to Figure 6, if this is the first call, a transactional identifier is 
generated for the DRPC (i.e., ,<table name>_update) at processing block 602. At 
processing block 604, an entry is inserted into the transactions table as 
. illustrated in Figure 8 A. A transaction identifier (e.g., 1), DON;£me, and 

5 deferring user identifier are assigned to the inventory_update transaction. 

♦ 

At blocks 606 through 610, the transaction nodes table is populated. In 
this case, the only remote copy of the data is located at DbB. Therefore, one 
entry is inserted in the transaction nodes table where the transaction identifier 
10 is the same as the same field in the transactions table, and the destination node 
is set to DbB. 

An entry is created in the calls table for each call in the transaction. 
Referring to Figure 7, an entry is inserted into the calls table to reflect the 
15 inventory_update DRPC. The call identifier is a unique identifier for the 

inventory_update DRPC. The transaction identifier has the same value as the 

4 

same field in the transactions and transaction nodes tables. The deferred 
procedure identifier can be any value that identifies the DRPC. In this case, the 
name of the DRPC is used. 

20 

If an external routing mechanism is not being used, the call nodes table 
can be populated with the entries that identify the user specified destination 
nodes at which execution of the DRPC is deferred (i.e., execution nodes). 
Referring to 8A, the destination list field indicates that the destination nodes 
25 are not specified by an external routing mechanism. In this case, the 

destination nodes for the transaction's one DRPC (i.e., inventory_update) can 
be determined by the entry in the transaction nodes table. However, the call 
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nodes table can be used to identify DbB as the destination node for execution of 
the inventory_update DRPC. 

i i 

i 

Any parameters associated with a call are stored in the call's entry in the 
5 calls table. Referring to Figure 5B, the update procedure uses the widget's old 
inventory values and new inventory values. The widget entry in the 
inventory table is uniquely identified by the old value "widget.". Because each 
entry in the table contains two fields, there will be two old values and two new 
values. Therefore, there are four arguments associated with the 
10 inventory_update procedure. The arguments (including an example of their 
attributes) are as follows: 

Old values New values 

Type Length Value Type Length Value 

2 ,. 06 Widget 2 06 Widget 

15 1 03 400 1 03 350 

The parms field in the calls table entry associated with this call contains 
a string of parameter information. A terminating value (e.g., "0") is placed at 
the end of the string. The resulting string is: 
20 "206Widgetl03400206Widgetl033500". 

If inventory_update is considered to be a non-transactional DRPC, the 
process of inserting entries in the transactions and transaction nodes tables 
could be bypassed. To associate the non-transactional DRPC with the 
25 destinations at which the DRPC is to be executed, the call nodes table can be 
used. An entry can be placed in the call nodes table for each location at which 
the call is to be executed. 
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Figure 8B illustrates the state of the replication tables after their 
population with a non-transactional DRPO The transactions and transaction 
nodes tables are not used- The calls and call nodes tables are the same as in 
Figure 8 A with the exception of the transaption identifier fiel4- Since there is 
5 no associated transaction, there is no entry in the transaction and transaction 
nodes tables, and no transaction identifier value- 

Column-Level Replication 

10 Column-level replication is a variation of lost update prevention -for 

row-level replication. Column-level replication applies only to update 
operations. In column-level replication, concurrent updates can be made to 
disjoint sets of non-primary key .columns. Lost updates are prevented only 
when the updates are to columns whose values have been changed (i.e., those 

15 columns changed at the originating site as indicated by a difference between 
the old_value and new_value parameters). 

Column-level replication uses the same <table name>_update 
procedure as row-level replication. Figure 5B illustrates the differences in the 

20 logic for detecting lost updates and applying updates for row-level and 
column-level replication schemes. A row-level replication scheme 
determines that a remote site's current values match old values for all 
columns in a table entry (at decision block 536) prior to applying an update at a 
remote site. Column-level replication checks ( at decision block 534) only 

25 those columns that were changed by the original update (as indicated by a 

difference between the values of the corresponding old and new parameters). 
If the oldjvalue parameters of the changed columns are «qual to their 
corresponding values at the remote site, the loss of an intermediate update is 
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unlikely, and the update operation can be completed. If the old_value 
parameters of the changed columns are not the same as their corresponding 
values and lost updates are to be prevented (at decision block 538), an exception 
is raised (processing block 540), and the update operation is not performed. If 
5 lost updates are not to be prevented <at decision block 538), thq update 
operation can be performed. 

A row-level update operation involves all of the columns in a table 
entry (at processing block 546). However, a column-level update operation 
10 involves only those columns changed by the original update (as indicated by a 
difference between the values of the corresponding old and new parameters). 
Thus, in a column-level update operation, only those columns changed by the 
original update are updated with their corresponding value in the newjvalue 
parameters (processing block 544). 

Procedure-Level Replication 

Procedure-level replication provides another alternative for replicating 
data modifications. As previously discussed, row-level replication is value 

20 oriented. That is, the values that are the result of some operation are 

replicated to remote copies. In contrast, procedure-level replication provides 
the ability to replicate the logical operation at remote sites. That is, the 
procedure that modified the local copy can be replicated at the remote sites. 
Thus, after the execution of a procedure at the originating site, the procedure 

25 creates a DRPC such that the procedure defers itself to another site to apply the 
logical update on a remote copy. 
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Protedure-level replication provides additional flexibility in that an 
application (e.g., order processing example in Figures 2A-2C) -tan determine ' 
how a replication is to be propagated, and how replication conflicts (e.g., 
multiple conflicting updates to the same dafa item),are to be addressed. That 

5 is, a DRPC can designate its propagation and the process to be invoked when 

» ■ 
replication conflicts are identified. 

Figures 9A-9B provide an example of an application that uses 
procedure-level replication to replicate its logical operations to other sites.^ The 
10 application assumes the relations (i.e., tables) described in Figures 2A-2D. 
Further, any changes made to the tables at the DbA site must be replicated at 
the DbB site, and vice versa. 

In the basic order transaction illustrated in Figures 2B-2C, once a 
15 customer's order is received, the inventory table is updated to reflect a decrease 
in inventory by the quantity ordered, and an entry is inserted in the orders 
table to retain information about the order. If the order is received and 
processed at DbA, the same order processing is replicated at DbB as illustrated 
in Figure 2D. 

20 

Figure 9A illustrates the original order processing that is performed 
when an order is received at one of the sites (e.g., DbA). To summarize the 
order processing at DbA, at processing block 902, the item ordered is found in 
the inventory table, and the inventory amount is stored in qoh. If the entry is 
25 not found, an exception is raised at block 906. If the inventory amount {i.e., 
qoh) is greater than the quantity ordered, the inventory amount is decreased by 
the quantity ordered, and the order is considered to be filled at blocks 908, 912, 
and 914. If the inventory amount is not greater than the quantity ordered, the 
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order is considered to be backordered. In either case, the order is entered into 
the orders table at processing block 916. » ■ 

Once the order is processed at DbA, the order processing is replicated at 
5 remote sites by entering the replicated procedure in the replication tables, and 
subsequently executed the replicated procedure at the remote sites. At 
processing blogk 918, the information associated with the replicated procedure 
is stored in the replication tables by invoking Queue_transactional_DRPC. As 
described earlier, Queue_transactionalJDRPC stores the replicated 
10 information in the tables. Processing block 918 further illustrates some of the 
information that is stored in the replication tables. 

Figure 10 illustrates the state of the replication tables after 
Queue_transactional_DRPC has processed the replicated procedure's 
15 information. The transactions table is populated with a transaction identifier 
to uniquely identify the order processing transaction, as well as the 
transaction's DON, commit time, and deferring user. The transaction nodes 
table has one entry for the remote copy of the inventory and orders tables 
located in DbB. 

20 

The calls table contains an entry to identify the place_order_remote 
procedure for this order processing transaction (i.e., 4), and the parameter 
count is set to five. The parameters field in the calls table contains the 
parameter information (i.e., item ordered, originating site, customer, quantity 
25 ordered, and the order's status in the DbA database). The call nodes table 

contains an entry to identify the node at which the place_order_remote DRPC 
is to be executed. 
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Figure 9B provides an example of the remote processing of the place 
order process at DbB. Except for the change in the data copy being modified, 
the processing blocks 902 through 916 are the same as the same blocks in Figure 
9 A. As stated previously, procedure-level replication provides the ability to 
5 allow an application to handle exceptions that occur as a result of the 

replication of the procedure at other sites. In the present example, a check is 
made to determine whether there is a discrepancy in the ordered item's 
inventory count in DbA and DbB. One way of determining this is by 
determining whether the order could be filled based on the QuantityOnHand 
10 information in both databases. Thus, at decision block 920 in Figure 9B <i.e., 
"status = filled?"), the order's status at DbB is checked against the order's status 
at DbA. If they are not equal, an exception can be raised for later review. 

»• * 
Instead of raising an exception for later review, other methods of 
15 addressing this discrepancy can be included in the procedure. In the present 
example, the procedure could have been designed to either modify the order at 
DbA, or at DbB. In any case, the present invention provides the ability to allow 
an application to process exceptions using procedure-level replication. 
Whether or not a discrepancy is detected at decision block 920, processing ends 
20 at 924. 

Deferred Remote Procedure Call Initiation 

As previously indicated, once a DRPC has been placed in the replication 
25 tables, the present invention provides the ability to subsequently execute the 
DRPC at a remote site. Figure 11 A illustrates a process for initiating deferred, 
transactional DRPCs contained in the replication tables. 
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The selection of transactional DRPCs can be performed using a number 
of different criteria. For example, they can be selected based on the transaction 
identifier, transaction destination, or a combination of the two- Whatever the 
selection criteria, the transactions to be processed are selected at processing 
5 block 1102* Select transactions are ordered for execution according tQ the DON 
field in the transaction table entries. At decision block 1104 (i.e., "all selected 

transactions processed?"), if all of the selected transactions have been executed, 

i 

processing ends at 1132. . 1 

10 If there are remaining transactions, the next transaction is obtained, and 

its identifier is stored in current_trans_id at processing block 1106. Hie 
modification operations performed in a transaction can be undone before they 
are committed. At processing block 1108, a point (i.e., savepoint) is established 
to identify the state of the data prior to the modifications of the current 

15 transactions. 

At decision block 1110 (i.e., "all calls processed?"), if all of the calls in the 
current transaction have been processed, processing continues at processing 
block 1112. At block 1112, the entry in the transaction nodes table that 

20 corresponds to the transaction processed and the remote copy modified are 

deleted from the transaction nodes table. At decision block 1114 (i.e., "anymore 
remote copies to apply DRPC to?"), if there is some need to retain the current 
transaction's entry in the replication tables, processing continues at processing 
block 1118. If there is no need to retain the entry in the replication tables, the 

25 entry is deleted from the replications table at processing block 1116. At 

processing block 1118, the current transaction's modifications are committed 
making its modifications permanent. Processing continues, at decision block 
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1104 (i.e., all selected transactions processed?"), to perform the remaining 
transactions. • • 

If at decision block 1110 (i.e., "all calls processed?"), if all pf the DRPCs 
5 have not been executed in the current transaction, processing continues at 
processing block 1120. At processing block 1120, a call nodes table entry ior the 
destination node currently being processed. Processing block 1121 identifies a 
calls table entry for the call identified in processing block 1120. 

10 At processing block 1122, the DRPC call string is reconstructed. A DRPC 

call string is one technique for identifying the DRPC to be executed. Other 
means for identifying the DRPC can be used. For example, a DRPC can be 
identified by an internal representation of the call string. In this case, the 
DRPC can be executed by an optimized low level system interface using the 

15 internal representation. Thus, the call can be invoked without reconstructing 
an DRPC call in its entirety. 

If the DRPC is identified and executed using a conventional DRPC call 
string, the reconstructed call string for the place_order_remote procedure call 
20 in the calls table in Figure 10 is: place_order_remote (Widget, DbA, 10,^0, 

filled). If the remote site is included in the call, the call is: place_order_remote 
@DbB (Widget, DbA, 10, 50, filled). 

Referring to Figure 11A, at processing block 1122, a DRPC call string : 
25 reconstructed using the procedure identifier and parms fields from the caiis 
table. The parameters for a call are reconstruct using the parms field in the 
calls table. Figure 13 illustrates a process flow for parsing the parms field. At 
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processing block 1302, a byte is extracted from the parms field. This byte 

i 

represents the type of the current parameter in the parms field. # 

At decision block 1304 (i.e., "Type = 0?"), if the byte extracted from the 
5 parms field is equal to a termination value, processing ends at block 1306. If 
the byte is not a terminating value, processing continues at processing block 
1308 to set "Len" to the next to bytes of the parms field. At processing block 
1310, "Value" is set to the next "Len" bytes from the parms field. At prpcessing 
block 1312, the contents of "Value" and "Len" are passed to a call constructor 
10 facility to incorporate this parameter information in the reconstructed call, 
Processing continues at block 1302 to process any remaining parameters in the 
parms field. 

Referring to Figure 11 A, at processing block 1124, the procedure is 
15 executed at the remote site. At decision block 1126 (i.e., "successful 

execution?"), if the procedure was successfully executed, processing continues 
at processing block 1127 to delete the selected call nodes table entry. Processing 
continues at decision block 1110 to check for additional calls to process. 

20 If the procedure's execution was unsuccessful, the modifications made 

since the savepoint previously created are undone at processing block 1128. At 
processing block 1130, the Exceptions is invoked to create an entry in the 
Exceptions table to retain information concerning the unsuccessful completion 
of the procedure. The exception table can be stored at any location (e.g., 

25 originating, destination, or both sites). In the preferred embodiment, the 
exceptions table is stored at the destination site. Processing continues at 
decision block 1104 with any remaining transactions. 
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— Exceptioh Process Flow — . . 

An exception can be stored in a relation that is located on either thie 
originating or destination sites, or both. Figure 12 illustrates a process flow for 
5 storing an exception. At block 1202, entries are created in the error and 

transaction tables in the destination replication tables based on the values in 
the, originating site's tables. At processing block 1204, entries are created in the 
calls and call nodes table in the destination replication tables based on the 
values in the originating site's tables. At block 1206, processing ends. 

10 

Figure 11A illustrated a method for executing transactional DRPC 
entries contained in the replication tables. Other methods can be used to 
, process the replication table entries using replication capabilities of the present 
invention. For example, the present invention provides the ability to execute 
15 non-transactional DRPC entries contained in the replication tables. Figure 11B 

illustrates a method for executing non-transaction DRPC entries. 

« 

At processing block 1152, the non-transactional DRPCs to be executed are 
selected. At decision block 1154 (i.e., "all selected calls processed?"), if all of the 

20 DRPCs selected have been processed, processing ends at block 1174. If there are 
calls remaining, processing continues at processing block 1156 to select the next 
call and identify it by its call identifier. As in Figure 11A, the processing of a 
non-transactional DRPC in Figure 11B includes establishing asavepoint <at 
block 1158), constructing an DRPC call (at block 1160), and executing the DRPC 

25 at a remote site (at block 1162). If the execution is not successful, the changes 
since the savepoint are undone (at block 1172) and an exception is raised (at 
block 1174). If the execution is successful and no exceptions are raised during 
the execution, the record for this destination is deleted from the call nodes 
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table (at block 1168), and processing continues at decision block 1169. At 
decision blpck 1169 (i.e., "additional destinations for call?"), if there are 
additional destinations for the current call, processing continues at decision 
block 1154 to processing any remaining calls. If there are no additional 
5 destinations for call, the modifications are cbmmitted at block 1170. 

Conflicts 

The present invention provides the ability to identify conflicting 
10 modifications. For example, updates that have occurred to a remote copy of 
the data may be lost, if the replicated modification overwrites the current 
values in the remote copy. Thus, it is important to detect any conflicts. 
Further, if a conflict is detected, the present invention provides the ability to 
communicate an exception, to rollback any changes to a data copy after an 
15 exception is detected, and to incorporate exception handling in an application 
program. Exceptions and conflict information can be stored at the originating 
site, the destination site, or both. 

- Conflict Detection - 

20 

As stated previously, a row-level replicated deferred remote procedure 
call has both the old and new values as part of its parameters. Thus, a 
potential conflict can be detected by comparing the old values of the row at the 
original updating site with the current value of the row at the targeted site for 
25 the replicated modification. If the values are different, a conflict exists between 
the local and remote data copies. 
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As previously illustrated the present invention provides the ability for 
an application, such as the order processing ^example described herein, to 
include error detection. As illustrated in the order processing example, the 
place_order_rempte process includes examination of the loc^l and remote 
5 copies of the Order status. Thus, the present invention provides the ability for 

an application to identify conflicts within one of its procedures. 

* ... , 

■ * 
— Treatment of Conflicts — 

10 When a conflict is detected, information regarding the conflict can be 

identified and stored in the exceptions table (see Figure 3). The fields of the 
exception table provide an error code and a descriptive error string. In 
addition, the exceptions tables provides keys into other replications tables. 
This provides the ability to access the information stored in the replication 

15 tables associated with a DRPC in which the exception is raised. 

For example, as a key into the transactions table, the exceptions table can 
contain a transaction identifier that corresponds to the current transaction 
being processed. To access the related entry in the calls table, the exceptions 
20 table further contains the call identifier of the current calls table entry being 
processed and the destination node (i.e., remote copy location). The calls table 
can be accessed using a procedure's call_id. 

In addition to retaining information regarding a conflict, the present 
25 invention provides the ability to undo a deferred transaction such that all 
updates deferred by the original transaction are undone. The execution of 
deferred calls is contingent upon successful commit of a deferring transaction. 
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If the deferring transaction is rolled back, the deferred calls 1 queue encoded in 
the replication is rolled back. 

The present invention further provides the ability to incorporate error 
5 handling in an application. Thus, an error can be handled as soon as an error 
is detected, or deferred for later processing. The exceptions table provides the 
ability to address any errors after an application's normal processing. Hie 

subsequent error processing can be done with various degrees of operator 

♦ 

intervention and automation. The present invention provides the flexibility 
10 to allow an application to address the type of error handling. • 

♦ 

Multiple conflict routines can be supplied to be used to resolve a conflict 
when it arises. They can be called in order until one of them returns a 
successful return value. If none of the resolution routines are successful, the 
15 exception is retained as an exception. 



Thus, a method and apparatus for data replication has been provided. 
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1. A method of replicating data modifications in one or more 
computer systems comprising the steps of: 

5 storing, in a first computer system, a first copy of a data item; 

storing, in a second computer system, a second <:opy of a data item; 
performing a first modification to said* first copy of said data item; 
storing information about said modification such that said modification 
can be applied to said second copy of said data item; and 
10 using said information about said first modification to perform a sfecond 

modification to said second copy of said data item. 

• 

2. The method of claim 1 further including the step of identifying 
any exceptional occurrence while performing said first modification or said 

15 second modification. 



3. The method of claim 2 further wherein said exceptional 
occurrence is a conflict identified when performing said second modification. 
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4. The method of claim 1 wherein said step of storing said 
information about said modification in said relations further includes the 
steps of: 

i 

retaining information regarding logical unit of work in a transactions 
5 relation; 1 

retaining information regarding the destinations at which said logical 
unit of work is^ to be performed in a transaction nodes relation; , 

retaining information regarding procedures for replicating said logical 
unit of work; 

10 retaining information regarding one or more destinations at which said 

procedures are to be performed; and 

retaining information regarding exceptional conditions raised during 
said first and second modifications in an exceptions relation. 

15 5. The method of claim 4 wherein said step of retaining information 

in said transactions relation further comprises the steps of: 

providing a logical unit identifier in said transactions relation such that 
said logical unit identifier uniquely identifies each occurrence of said logical 
unit in said transactions relation; 
20 providing an ordering value for each occurrence in said transactions 

relation such that said ordering value identifies the order in which 
modifications associated with each occurrence were applied to said data item; 

providing a time value for each occurrence in said transactions relation 
such that said time value identifies the time at which modifications associated 
25 with each occurrence were applied to said data item; and 

providing a user identifier for each occurrence in said transactions 
relation, said user identifier identifies the user invoking the modifications 
applied to said data item. 
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6. The method of claim 4 whetein said step of retaining information 
in said transaction nodes relation further comprises the steps of: 

providing ^ logical unit identifier in, said transaction npdes relation, said 
5 logical unit identifier identifies each occurrence of said logical unit in said 
transaction nodes relation; and 

providing a destination identifier in said transaction nodes relation, said 
destination identifier uniquely identifies each location of said data item to be 
modified. 

10 

7. The method of claim 4 wherein a transaction identifier and 
destination identifier pair uniquely identifies each occurrence in said 
transaction no4es relation. 

15 8. The method of claim 4 wherein said step of retaining information 

regarding procedures further comprises the steps of: 

storing said information regarding procedures in a calls relation; 

providing a calls relation entry identifier in said <:alls relation that 
uniquely identifies an entry in said calls relation; 
20 providing a logical unit identifier in said calls relation that uniquely 

identifies each occurrence of said logical unit in said calls relation; 

providing a procedure identifier associated with each occurrence in said 
calls relation that identifies a logical series of steps for modifying said data 
item; and 

25 providing a parameter count value for each occurrence in said calls 

relation such that said parameter count value identifies the informational 
items used to complete said logical series of steps. 
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9. The method of claim 8 further including the step of: 
providing parameter information that, identifies a plurality of 
arguments for said each of occurrence. 

. . . • 

5 10. The method of claim 9 wherein said parameter information is 

, stored in said calls relation. 

i 

11. The method of claim 9 wherein said parameter informatipn 'is 
stored in a parameters table. 

10 

12. The method of claim 11 wherein said parameters table further 
includes said call relation entry identifier and said logical unit identifier. 

13. The method of claim 4 wherein said step of retaining information 
15 in said calls relation further comprises the steps of: 

providing a calls relation entry identifier in a parameters relation, said 
calls relation entry identifier associates each entry in said parameters relation 
with a logical series of steps; and 

providing a parameter ordering value, said parameter ordering value 
20 identifies the order in which each entry associated with one of said logical 

series of steps are listed in a string identifying said one of said logical series of 
steps. 
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14. The method of claim 4 wherein said step of retaining information 
in said exceptions relation further comprises the steps of: 

providing a logical unit identifier in said exceptions relation, said logical 
unit identifier identifies each occurrence of said logical unit in*said exceptions 
5 relation- 
providing a destination identifier in said exceptions relation, said . 

destination identifier uniquely identifies eafch location of said <iata item to be 

♦ 

modified; 

providing a code value to identify an. exceptional occurrence! while 

♦ 

10 performing said first modification or said second modification; and 

providing a descriptive value to identify said exceptional occurrence 
while performing said first modification or said second modification. 

15. The method of claim 4 wherein said step of retaining information 
15 regarding said one or more procedure destinations further including the steps 

of: 

♦ 

storing a mapping identifier in said relations to identify a mechanism 
for mapping said procedures to said one or more procedure destinations. 

20 16. The method of claim 15 wherein said mapping mechanism is a 

call nodes relation, said call nodes table uniquely identifies an occurrence of a 
procedure in said calls relation and a destination at which said occurrence of a 
procedure is to be performed. 
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17. A method of replicating a value-oriented change to copies of data 
in one or more computer systems comprising the steps of: 

♦ 

providing a first computer system containing a first copy of said data; 
providing a second computer system containing a second copy of said 
5 data; and 

associating a procedure with said first copy of said data such that said 
procedure is invoked when a modification is made to said first copy of said 
data, and said" procedure causes information associated with said modification 
to be stored in relations. 

10 

18. The method of claim 17 further including the steps of: ' 
accessing said information associated with said modification; and 
applying said modification to said second copy of said data. 

15 19. The method of claim 17 wherein said modification is an update 

and said application of said modification to said second copy of said data 
consists of replacing the current value of said second copy of said data with a 
new value of said first copy of said data. 

20 20. The method of claim 17 wherein said modification is an update 

and said application of said modification to said second copy of said data 
consists of: 

comparing an old value of said first data with a new value of said first 
copy said data to identify changed data in said first copy of said data; 
25 replacing a current value of said changed data in said second copy of said 

data with a new value of said first copy of said data. 
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21. The method of claim 17 wherein said modification is a delete and 
said application of said modification to said secdnd copy of said data consists bf 
deleting said second copy of said data, 

* 

5 22. The method of claim 17 wherein said modification is an insert and 

said application of said modification to said second copy of said data consists of 
creating said second copy of said data, said second copy of said data having the 
same value as said first copy of said data. 

10 23 The method of claim 17 further including the steps of: 

identifying any exceptional occurrence while performing said 
modification; and 

retaining information regarding said any exceptional occurrence in said 
relations. 

15 

24. The method of claim 23 wherein the step of identifying said any 
exceptional occurrence further comprises the steps of: 

comparing an old value of said first copy of said data with a new value 
of said first copy of said data to identify changed data; 
20 comparing an old value of said changed data with a current value x>f 

said changed data in said second copy of said data; and 

identifying an exception when said old value is not equal to "said current 

value. 
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25. The method of claim 23 wherein the step of identifying said any 
exceptional occurrence further comprises the steps of : 1 

comparing an old value of said first data with a current value of said 

i 

second data; and 

5 identifying an exception when said old value is not equkl to said current 

value. 

26. A method of replicating a logicroriented modification to copies of 
data in one or more computer systems comprising the steps of: 

10 providing a first computer system containing a first copy of said data; 

providing a second computer system containing a second copy of said 

data; 

modifying said first copy of said data with a procedure containing a 
series of steps for modifying said data; and 
15 retaining information about said procedure in relations- 

27. The method of claim 26 further including the steps of: 
accessing said information about said procedure; and 
executing said procedure to modify said second copy of said data. 

20 

28. The method of claim 26 further including the step of identifying 
any exceptional occurrence while performing said procedure. 



25 



29. The method of claim 28 wherein said step of identifying said any 
exceptional occurrence is incorporated in said procedure such that said 
procedure defines the circumstances that raise said exceptional occurrences. 
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30. The method of claim 28 further including the step of addressing 
said any exceptional occurrence. 1 1 

31. The method of claim 30 wherein said step of addressing said any 
5 exceptional occurrence is incorporated in said procedure such that said 

procedure handles said any exceptional occurrence. 

32. The method of claim 30 wherein said step of addressing said any 
exceptional occurrence is done by said procedure during execution of said 

10 procedure, and said information associated with said any exceptional 

occurrence is re :ned in said relations to be addressed after said execution of 
said procedure. 

33. A method of replicating a value-oriented change to copies of data 
15 in one or more computer systems comprising the steps of: 

providing a first computer system containing a first copy of said data; 
providing a second computer system containing a second copy of said 
data; and 

associating a plurality o: procedures with said first copy of -said data and 
20 said second copy of said data; 

invoking one of said plurality of procedures when a modification is 
made to an associated copy of said data; 

determining whether said modification is of said first -copy of said data; 

and 

25 storing information associated with said modification when said 

modification is of said first copy of said data. 
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34. The method of claim 33 wherein said step of determining 
whether said modification is of said first copy of said data further cdmprises 
the steps of: 

setting a replication flag set to true when said modification is made to 
5 said first copy of said data; • , 

storing information associated with said modification when sai4 
replication flag is false; and 

disregarding said information associated with said replication flag is 

true. 

10 

35. The method of claim 33 wherein said step of determining 
whether said modification is of said first copy of said data further comprises 
the steps of: 

associating a first user identifier with said modification when said 
15 modification is of said first copy of said data- 
associating a second user identifier with said modification when said 
modification is of said second copy of said data; 

storing information associated with said modification when a user 
identifier associated with said modification is said first user identifier; and 
20 disregarding said information associated with said modification when 

said user identifier associated with said modification is said second user 
identifier. 
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